Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation

Identifieur interne : 008F43 ( Main/Exploration ); précédent : 008F42; suivant : 008F44

Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation

Auteurs : Bart Lamiroy ; Laurent Najman ; Romain Ehrhard ; Céline Louis ; Franck Quélain ; Nicolas Rouyer ; Nabil Zeghache

Source :

RBID : CRIN:lamiroy01a

English descriptors

Abstract

This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" wicri:score="416">Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:lamiroy01a</idno>
<date when="2001" year="2001">2001</date>
<idno type="wicri:Area/Crin/Corpus">002E15</idno>
<idno type="wicri:Area/Crin/Curation">002E15</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">002E15</idno>
<idno type="wicri:Area/Crin/Checkpoint">001559</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">001559</idno>
<idno type="wicri:Area/Main/Merge">009464</idno>
<idno type="wicri:Area/Main/Curation">008F43</idno>
<idno type="wicri:Area/Main/Exploration">008F43</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation</title>
<author>
<name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
</author>
<author>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
</author>
<author>
<name sortKey="Ehrhard, Romain" sort="Ehrhard, Romain" uniqKey="Ehrhard R" first="Romain" last="Ehrhard">Romain Ehrhard</name>
</author>
<author>
<name sortKey="Louis, Celine" sort="Louis, Celine" uniqKey="Louis C" first="Céline" last="Louis">Céline Louis</name>
</author>
<author>
<name sortKey="Quelain, Franck" sort="Quelain, Franck" uniqKey="Quelain F" first="Franck" last="Quélain">Franck Quélain</name>
</author>
<author>
<name sortKey="Rouyer, Nicolas" sort="Rouyer, Nicolas" uniqKey="Rouyer N" first="Nicolas" last="Rouyer">Nicolas Rouyer</name>
</author>
<author>
<name sortKey="Zeghache, Nabil" sort="Zeghache, Nabil" uniqKey="Zeghache N" first="Nabil" last="Zeghache">Nabil Zeghache</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>automated generation</term>
<term>component algebra</term>
<term>document analysis</term>
<term>hyperlink</term>
<term>xml</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="3902">This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.</div>
</front>
</TEI>
<affiliations>
<list></list>
<tree>
<noCountry>
<name sortKey="Ehrhard, Romain" sort="Ehrhard, Romain" uniqKey="Ehrhard R" first="Romain" last="Ehrhard">Romain Ehrhard</name>
<name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
<name sortKey="Louis, Celine" sort="Louis, Celine" uniqKey="Louis C" first="Céline" last="Louis">Céline Louis</name>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
<name sortKey="Quelain, Franck" sort="Quelain, Franck" uniqKey="Quelain F" first="Franck" last="Quélain">Franck Quélain</name>
<name sortKey="Rouyer, Nicolas" sort="Rouyer, Nicolas" uniqKey="Rouyer N" first="Nicolas" last="Rouyer">Nicolas Rouyer</name>
<name sortKey="Zeghache, Nabil" sort="Zeghache, Nabil" uniqKey="Zeghache N" first="Nabil" last="Zeghache">Nabil Zeghache</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 008F43 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 008F43 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     CRIN:lamiroy01a
   |texte=   Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022